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Abstract 



Using high frequency data, we have studied empirically the change of volatility, also called volatil- 
ity derivative, for various time horizons. In particular, the correlation between the volatility deriva- 
tive and the volatility realized in the next time period is a measure of the response function of the 
market participants. This correlation shows explicitly the heterogeneous structure of the market 
according to the characteristic time horizons of the differents agents. It reveals a volatility cascade 
from long to short time horizons, with a structure different from the one observed in turbulence. 
Moreover, we have developed a new ARCH-type model which incorporates the different groups 
of agents, with their characteristic memory. This model reproduces well the empirical response 
function, and allows us to quantify the importance of each group. 
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1 Introduction 



Financial markets are very interesting self-organized structures. On a given market, say for ex- 
ample the foreign exchange (FX) market for USD/CHF, a large number of agents are present. 
These agents differ by their geographic locations, individual preferences, economic expectations, 
information sets, present market positions, educations, risk aversions or professional constraints. 
Yet, at a given time point, the market agree on one thing: a price. An interesting categorization 
of the market participants can be found in their characteristic time frame: intra-day speculators 
and market makers, daily traders, portfolio managers rebalancing their positions every week, or 
pension funds and central banks that are active at a scale of months using fundamental economics 
measures. Although this categorization makes sense, until now it has gone unobserved. Essen- 
tially, the only endogenous information available about a financial market is the resulting price p 
as a function of time t, and the curve p{t) looks like a random walk. 

A related topic is the efficient market hypothesis (EMH) [1,2]. This hypothesis can be formulated 
in many different ways, with various strengths. For example, a (semi) strong formulation can 
be "given all the publicly available information, the price process is a martingale", and a weak 
formulation could be "there are no dependencies in past price changes that a technician could use 
to predict future changes". There exists a huge body of literature on this topic, with many empirical 
tests of particular formulations of the hypothesis. This hypothesis is rooted in the rationality of 
the market participants: humans are rational and behave in their best interests. Within a strong 
formulation of the EMH, given (all) the information at time t, each market participant should 
behave in the same rational way, and the market should incorporate "instantaneously" every new 
information to reach a new equilibrium price. This implies that the market participants behave as 
one group, a picture quite different from the time characterization given above. On the other hand, 
research on the microstructure of the FX market conducted by questionnaires survey of dealers [^] 
indicates a heterogeneous set of time horizons, and the practical importance of technical analysis. 

Another piece of evidence related to the market composition is the recent analogy with fully de- 
veloped turbulence [ ]. These authors have compared the probability density function (pdf) of the 
return (i.e. price changes) for a set of time horizons ht with the pdf of velocity differences in a fluid 
for a set of position differences. The striking agreement of the pdf 's leads to the conclusion that 
the vorticity cascade responsible for turbulence should have a counterpart in financial processes. 
Therefore, an information cascade must be present in financial market, from long time horizons 
up to intra-day traders. 

The basic question underlying the above points is the homogeneous or heterogeneous composition 
of the financial market, as well as the possible different agent's time responses and mutual inter- 
actions. Beside some indirect evidences and arguments, this is essentially an open question. An 
indirect evidence for the mai^ket structure has already been obtained in [ ^] thr^ough a log-likelihood 
estimate for the HARCH model (with an induced market structure in agreement with the results 
below). What is missing is a statistical estimate, derived from the the price process p{t), that 
is able to display the underlying structure of the market participants. By studying the volatility 
time derivative, we have found such a quantity in the coiTclation between the change of volatility 
and the realized volatility, thus providing a visual proof of a heterogeneous market structure. The 
paper proceeds as follows: the next section will introduce the definitions we are using. Then, we 
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will present empirical results, followed by a model for the volatility process that incorporates the 
structure of the market. 



2 The volatility and its time derivative 

Our data processing starts with tick by tick quotes obtained from Reuter. These quotes arrive at 
random time t and contain a bid and ask price. From the quotes, we compute the logarithmic 
middle price x{t) = 0.5 (ln(bid(?)) +ln(ask(f))). These raw mid-prices are then smoothed by a 
very short term moving average with a range of 3 minutes in order to eliminate the tick by tick 
noise. 

High frequency data contain very strong intra-day and intra-week seasonalities, namely a pre- 
dictable repetitive pattern due to the daily and weekly cycles of human activity. These seasonal- 
ities are filtered out by doing the computations in the proper business time scale. The key idea 
is similar to the usual business time scale used when working with daily data, namely to simply 
omit the week-ends and major holidays from the computations. The time scale we are using is an 
improvement along this idea, namely to expand periods of high activity and to contract periods of 
low activity (week-end, night). The seasonal activity pattern of high frequency data is measured 
on a moving sample, and the dynamic time scale is constructed by integrating the activity. The 
time scale is normalized such that, on average, a time interval of 5t in physical time scale is equal 
to bt in business time. The basic ideas are presented in [6], and the dynamic algorithm we are 
using is explained in detail in [7], including the discounting of holidays and the treatment of day- 
light saving time. Let us emphasize that the proper discounting of the seasonalities is a mandatory 
preliminary step in order to obtain the results presented in this paper. 

Given the dynamic time scale, we compute a regular time series x{i) of smoothed prices where the 
sampling is done every 5? = 10 minutes on the dynamic time scale. From this regular time series, 
the historical volatility is computed with 

al[5t„5tr]ii) = - I r^mU) (2) 

" i-p+l<j<i 

with dt,. = dt, 6?o = pht,n = Y,i-p+\<j<i- The denominator in eq. 1 "annualizes" the return, namely 
discounts the random walk scaling such that the expectation E [ p- [ht,] ] is essentially independent 
of htr, with a typical value of 10% for FX rates. The reference time interval Ar^f is taken to be 
one year. The volatility derivative a is computed using a smooth difference kernel according to 
[8] applied on the historical volatility 

a [6?A , 6?a ,htr]=^ [5?A ; Oh [hta ,htr]]. (3) 

The operator A essentially computes a finite difference A[8fA;z](0 — z{t) —z{t — 5?a), but using a 
convolution with a smooth kernel instead of the difference of pointwise values. Let us emphasize 
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that the notation a for the volatihty derivative is indeed referring to a finite difference at a time 
scale 5?A- This smooth derivative at a finite time scale is the appropriate notion of derivative for a 
random process, namely it measures the mean change at a time scale 5t^. For this article, in order 
to reduce the dimension of the parameter space, we restrict ourselves to 5?a = 5?o and 5t, = bt, but 
other choices lead to similar results. With this choice for the parameters, we can use the shorter 
notation a/,[5?(j] = Oh[btc,dt] and o[dtA\ = CT[6fA,5fA,5f]. In the finance literature, the analysis of 
the volatility derivative is new, as researchers have focused until now on the return and volatility. 
The volatility derivative a is a particularly interesting quantity as it measures dynamical aspects 
of the volatility evolution. A full analysis of the statistical properties of the volatility derivative is 
presented in [ ]. 

The historical volatility and volatility derivative at time t are computed using information in the 
past up to time t. The realized volatility corresponds to the "next" volatility after t, namely is 
computed from prices in the future of t. Using a forward time ti"anslation operator T [dt;x\{t) = 
x{t + ht), the realized volatility is 

Or[^t^] = l[hta;Oh[K]] (4) 
and we have again restricted ourselves to htr = ht. 

We have also explored other definitions for the volatility and the derivative. The volatility can be 
defined as an aggregated volatility with r[htr\ (/) = {x{i) — x{i — k)) / ^J 5?,/A7]ef with htr = kht. The 
derivative can be taken with a logarithm, namely a[5?A,5?(j,5fr] = A [6fA;ln(a/,[5fcj,5?,-])]. With all 
these definitions, very similar results are obtained, both for the empirical and simulated correlation. 



3 The market response function 

We have computed the usual linear correlation between the volatility derivative a[5?A] and the 
realized volatility ar[5?a]> for time intervals ranging from 4 hours to 42 days. This correlation 
measures the response function of the market to a change of volatility, similar to the phenomeno- 
logical susceptibility introduced in electro-magnetism with matter for example. As the market 
participants react to changes of volatility at a given time scale, they may change their positions 
and induce volatilities in the next time period. The correlation p ( 5?a , ) = p [a [5?a] , cr^ [6?o] ] mea- 
sures this response function. 

The computed correlation for 10 years of USD/CHF is displayed in Fig. 1, and clearly shows 
different groups of market participants. At short time scales, intra-day traders quickly react to 
short term change of volatility. However, short term volatility changes do not induce a response 
from traders with longer time horizons. Changes in the volatility at the daily time scale trigger 
the response of both intra-day and daily ti'aders, but not of the weekly and longer horizons market 
participants. Notice the gap between 8 hours to 1 day, corresponding to the absence of traders 
working inside this time frame. Then, slow change in the volatility induces a response of all 
market participants working at shorter time scales, while the maximal response is at a similar- time 
horizon. The correlation is always positive, which means that the mai^ket participants react to an 
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Figure 1: The empirical correlation between volatility derivative and realized volatility 
p[a[5?A],Cf,[5?a]] for the foreign exchange USD/CHF. The business time intervals corresponding 
to the axis label k are given by 8? = I'^l^ 4 hour, and span 4 hours (k=0) to 6 weeks (k=32). The 
data sample used to compute the correlation ranges from 01.01.1990 to 01.11.2000, and the dy- 
namic time scale and volatility are initialized using data from 1.7.1988 to 31.12.1989. 

increase of volatility by changing positions (and therefore they increase the realized volatility), but 
they are not likely to react to a decrease of volatility. Overall, the pattern that emerges is similar 
to a volatility cascade from low frequency to high frequency [ ], but with changes in volatihty 
triggering the response of all shorter time horizons. This is different from the picture in turbulence 
where the vorticity at a given time scale is related to vorticity only at nearby time scales. 

For other currency pairs or for stock indexes [^J], a similar structure emerges. However, there are 
quantitative differences, the most important one being a smaller cluster coiTcsponding to intra-day 
traders for stock indexes. This can be understood from the higher cost of trading stocks (brokerage 
costs and larger bid-ask spread), making it less profitable to trade intra-day. Finally, the con^elation 
between historical and realized volatility can be computed p[a/,[5?cj],CT,.[5?^]]- This correlation is 
dominated by the heteroskedasticity of the financial market, namely by the long memory (or the 
clustering) of the volatility. A finer structure due to the market components lies on top of the 
overall heteroskedasticity, but the structure of the market does not appear clearly in the volatility- 
volatility correlation. It is only the response induced by changes in volatility that reveals the 
components of the market. 
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4 Modeling the market components 



In order to fully understand the above picture, it is interesting to compare the empirical correlation 
with the one obtained with Monte Carlo simulations of theoretical processes. The simplest theo- 
retical model is an i.i.d. random walk. This model has a zero correlation p(5?A,5?o) = 0. A better 
benchmark, widely used in finance, is the GARCH(1,1) model [1 1]. A Monte Carlo simulation 
for this model shows a positive correlation, with one weakly defined maximum located around 
the correlation time of the process. This is clearly inadequate to model a market with sharply 
defined components. We have developed a new model, called Market-Component-ARCH(«) or 
MC-ARCH(«) model, in order to reproduce a market with n components. Structurally, this model 
draws from GARCH(1,1), the long memory model presented in [12], and the HARCH model 
[5, 13]. It is built using iterated exponential moving averages that induce a sharp cut-off for the 
memory of each component [8]. The MC-ARCH model equations are as follows: 



x{t + ht) 
r{t + ht) 

ols{t + ht) 



aeff(? + 5?) £{t + dt) 



We 



k=i 



MA[x;t,"i;''^](0 



(5) 
(6) 

(7) 
(8) 



with the constraint on the coefficients 



n 

Woo + £w4 = l. (9) 



The time interval bt fixes the time scale at which the process is defined. Eq. 5 says that the log- 
arithm of the price x = ln{p) follows a random walk with price increment r. From eq. 6, at each 
time step, the return r is the product of a magnitude Geff and a random variable £. The random 
variable e{t) is independent and identically distributed (i.i.d.), with the conditions £"[£(?)] = and 
E[£^{t)] = I. For the simulations, we have taken a Student-t distribution with v = 5 degree of free- 
dom, a number consistent with an estimate obtained through a maximum likelihood optimization. 
The magnitude aeff(? + 5f) can be seen as a forecast for the effective volatility of the market at 
t + 5t. This forecast is build using the information available at t (eq. 7). The constant a, with the 
constraint 9, fixes the mean volatility of the process, namely E[r^{t)] = E[olff{t)] = o^. The mean 
volatility is the volatility measured at an infinite time scale, and therefore its amplitude is denoted 
by Woo. The volatility Ok is measured by a moving average (MA) at the time scale of the squai^ed 
returns (eq. 8). Essentially, this term models the perceived current price volatility for a market 
participant with a memory of depth x^. in the past. The volatility contributes with a weight Wk 
to the effective volatility (eq. 7). The model parameters are x*:, Wk and a. The MA operator [ ] for 
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the time range x acting on the time series z is defined as 



MA[x,m;z] (t) 



EMAi(?) 
EMAj{t) 



/jEMAi(?-50 + (1-^)z(0 
^lEMAjit - 5?) + ( 1 - n)EMAj^i{t) 
exp(- 5?(m+ l)/x) 




(10) 



(11) 
(12) 

(13) 



with the shorthand notation EMAy = EMA[x, The coefficients (eq. 13) is computed from 
the time horizon x, so that the memory length of the MA operator is x. Technically, the memory 
length is twice the range of the kernel of the corresponding MA operator [8], a measure appropriate 
for rectangular like kernels. The MA operator is computed thr^ough a sum of iterated exponential 
moving average (EMA) (eq. 1 1 and 12). The coefficient m controls the shape of the decay for the 
memory, from exponential (m = I) to rectangular (m — > oo). Practically, m = 32 is already close to 
a rectangular memory. 

The structure of the MC-ARCH model is similar to the MARCH model [S, 1 ^] in that both include 
several volatilities measured on a set of time horizons. Yet, the HARCH model was developed 
mainly to include the asymmetry in the response function of the volatility measured with returns 
at different time horizons r[5tr], whereas we find this effect to be quantitatively unimportant. On 
the other side, it is important to have the proper time horizons for each market component, as 
well as the correct memory decay for the volatility measure, features not contained in the HARCH 
model. 

The parameters of the model have been optimized by simulations so as to reproduce the empirical 
figure for the coiTclation p[a[5?A],CT,-[5?cj]]- Good results are obtained by taking 5 components with 
characteristic times (measured in business days) x^ = 0.18 (intra-day), 1.4 (1 day), 2.8 (2 days), 
7(1 weeks) and 28 (4 weeks). The correlation obtained by simulation is given in Fig. 2 and the 
agreement with the empirical correlation is excellent. The coefficient m controlling the shape of 
the volatihty kernel has to be taken high enough, for the figure m = 64. For m = 1 , the shape of 
simulated correlation is too "soft", as it does not show the empirical abrupt drop to zero or the 
separation between intra-day and daily traders. This large value for m can be interpreted as an 
abrupt decay of the memory of the corresponding market component, namely the actors forget 
quickly the past beyond their characteristic time scale. The coefficients for each component Wk 
are respectively 0.39, 0.20, 0.18, 0.12, 0.11, and Woo = 0.00025. If we interpret these coefficients 
as measuring the "financial weight" of the respective component, we see that the largest fraction 
of the FX market is carried by short term dealers (intra-day, daily). Quantitatively, the actors with 
a characteristic time horizon up to two days account for 76% of the market. 

The MC-ARCH model does not contain an explicit term with a. A "pure" volatility model with 
the correct market structure is enough to reproduce the main feature of the empirical correlation 
p[a[5?A],CJr[5fa]]- Yet, we are not able to reproduce the sharp "valley" between intra-day and daily 
horizon, nor the decay of the conelation below the diagonal in the realized volatility direction. 
Possibly, a a term can be added in the MC-ARCH model in order to better match the empirical 
correlation, hence opening a whole new space of models. 
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Figure 2: The correlation between volatility derivative and realized volatility p[a[8?A],cTr[8fa]] for 
the l\/IC-ARCH(5) model, computed by Monte Carlo simulations. The business time interval cor- 
responding to the axis label k are given by 5t = l^l'^ 4 hour, and span 4 hours (k=0) to 6 weeks 
(k=32). The labels on the backdrop correspond to the characteristic time of each component of 
the MC-ARCH model, expressed in physical time (for time intervals shorter than a week, a factor 
5/7 is used to map business time intervals to physical time intervals in order to discount for the 
week-end). The length of the simulation is 10^ steps, corresponding to 19 years. 



8 



5 Conclusion 



The coiTelation between the change of volatility and realized volatility gives a picture of the market 
components and their responses. The pattern that emerges is that a change of volatility at a given 
time scale triggers a response, and therefore volatility, at all shorter time scales. The response 
function is clustered around values corresponding to well defined group of market participants, 
like intra-day dealers, portfolio managers or pension funds. This picture is a bit different from 
fully developed turbulence where the vorticity cascade relates nearby scales: the turbulence at a 
given scale is feed by the scale right above and feeds the scale right below. Moreover, the cascade 
is homogeneous. In financial markets, a change at a given time scale feeds all the shorter time 
horizons, and the structure is heterogeneous. 

The MC-ARCH(«) model presented here incorporates the relevant market structure, and has the 
same response function as observed in empirical data. It allows us to quantify the importance 
of each group, and shows that the agents quickly forget the past beyond their characteristic time 
horizons. Finally, to our amazement, from the apparently random walk of the price, we are able to 
extract only by statistical means a clear picture of the market heterogeneity. 
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